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We present the first exclusive observation of the tt — > hadronic t + jets decay channel. Using these events from 
1.96 TeV pp collisions at CDF, we measure the tt cross section as well as the top quark mass. Events require a 
single hadronic r, large missing transverse energy, and exactly 4 jets of which at least one must be tagged as 
a b jet. The cross section measurement is extracted from a Poisson likelihood function based on the observed 
number of events and the predicted number of signal and background events for a given tt cross section. The 
mass is extracted from a likelihood fit based on per-event probabilities calculated from leading-order signal (tt) 
and background (W+jets) matrix elements. 



1. Introduction 

We present the first exclusive observation of tt — hadronic r + jets events. With these events, we measure 
the production cross section in pp collisions at ^/s = 1.96 TeV with the CDF detector [1] at the Tevatron at 
Fermilab, as well as the first direct measurement of the top quark mass in r + jets events. These measurements 
provide important tests of lepton universality and probe the top quark properties in a relatively unexplored 
channel which may be sensitive to new physics. Additionally, they are good examples of physics measurements 
performed with t leptons in high jet multiplicity environments. 



2. Selection and Background Estimation 

This analysis uses a dataset with a total integrated luminosity of 2.2 fb^^ collected with the CDF detector 
between February 2002 and August 2007. The data is selected using a multi-jet trigger which requires at 
least four jets each with a calorimeter cluster with transverse energy {Et, where transverse refers to being 
perpendicular to the beamline) > 15 GeV and a total sum E-r of all reconstructed jets > 175 GeV. To these 
events, we apply selection criteria which require 4 jets with Et > 20 GeV, missing Et {^t) > 20 GeV, and a 
hadronically decaying r lepton with Et > 25 GeV. Additionally, one of the 4 jets must be identified as coming 
from a b quark (6-tagging) Since our signal process gives a single r lepton, we veto any event with an 
identified electron or muon. Hadronically decaying r's appear as narrow jets with an odd number of charged 
tracks and low n'^ multiplicity. They are selected using similar requirements as described in except we 
require both 1 and 3 prong r's to have visible Et of at least 25 GeV and a visible mass less than 1.8 GeV. We 
also place no explicit requirement on the transverse energy of the tt'^'s in the isolation region, but we do require 
that calorimeter energy in the isolation region be less than 10% of the r energy. 

2.1. Neural Network for QCD Multijets Removal 

The dominant background for this analysis is high jet multiplicity QCD events with one of the jets faking 
the signature of a t lepton. To further reduce the QCD multijets background, we developed an artificial neural 
network (NN) to distinguish between true — > r + jets events and QCD multijets events. First, we create 
a sample of QCD multijets from data by selecting events with a r with no track isolation requirement. The 
NN is trained to distinguish between these selected QCD multijets events and tt events generated with the 
Pythia MC generator |4j where the r decay is handled by the Tauola package [5] to properly account for the 
r polarization. We use 8 variables to train the NN: lead jet Et, sum of the jets and t lepton, sum 
Et of the two lowest jets and the t lepton, sum Et of the two highest Ey jets, transverse momentum of 
the W which decays to a r lepton, average T^-momcnt of all jets not identified as coming from a b quark, and 
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the lowest ratio of dijet mass to trijet mass for any possible triplet of jets. After training the NN, we find it 
provides good separation between QCD multijets and events as can be seen in Fig. [T] and we find the optimal 
signal significance is achieved by removing all events which return a NN output below 0.85. Before the NN 
requirement, we select 162 r + jets events which are largely QCD multijets events as is shown in Fig. [2j After 
the NN selection is applied, we find 41 events of which we expect roughly 18 QCD multijets events and 18 ti 
events. From MC studies, we estimate 76% of the selected tt events are hadronic r + jets decays. The majority 
of the remaining tt events come from all-hadronic tt decays with less than 7% contamination from tt e + 
jets. 
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Figure 1: NN output distribution for signal {tt in red) and background (QCD multijets in blue) events. The NN is 
trained witli tt events given an output value of 1 and background events given an output value of 0. We select events 
with a NN output value > 0.85. 




2.2. Background Estimation 

Due to the difhculty in MC modeling of QCD multijets events, b quark tagging algorithms, and the production 
of heavy flavor quarks in association with W bosons, we use a data-driven approach to estimate the background 
contribution similar to that described in [6 . First, we calculate the contributions from electroweak background 
processes which have a minimal contribution to the final total (diboson, single top quark production, and Z + 
jets events), as well as the tt signal contribution, by using the theoretical cross section for each process along 
with the acceptance from MC simulation and the total integrated luminosity. 

With these contributions known, we evaluate the contributions for QCD multijets and W + jets events by 
fitting the shape of the NN output distribution for each component (including the fixed contributions already 
calculated) to the data before the NN selection and 5-tagging requirements are applied. We fit these distributions 
using a binned Poisson likelihood. From this fit, we evaluate the percentage of the data events above the 0.85 
NN output value which are coming from QCD multijets. Any remaining events are assumed to come from W 
+ jets processes. 

We next apply 6-tagging efficiencies to sources except the data-based QCD multijet events to estimate the 
resulting contribution from each source after the 6-tagging requirement is applied. The W + jets events are 
divided up into contributions from W + light flavor and W + heavy flavor {W + bb, W + cc, and W + c). We 
then fit the resulting NN output shapes along with the QCD multijet NN output shape to the data after the 
6-tagging requirement is applied to calculate the contribution from QCD multijet events. The fits before and 
after the 6-tagging requirement are shown in Fig. [2] The contribution for each process assuming a top pair 
cross section of 7.4 pb and a top quark mass of 172.5 GeV is shown in Tab. |l| 

3. Top Anti-Top Production Cross Section Measurement 

Generally, cross sections are measured as: 
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Figure 2: Fit to tiie NN output shape beofre (top) and after (bottom) applying the 6-tagging requirement. The percent- 
ages listed in the legend are the percentage of events from each source contributing to the data sample after applying 
the NN selection requirement (shown with an arrow). 



Source 


Number of Events 


Diboson 


0.19 ±0.01 


Single top 


0.16 ±0.01 


Zhb 


0.29 ±0.04 


Whb 


0.57 ±0.47 


Wcc 


0.34 ±0.28 


Wc 


0.15 ±0.13 


W+lf 


0.46 ±0.60 


QCD multijets 


18.24 ±4.10 


Total Bkgd 


20.40 ±4.18 


Top 


18.17 ± 2.79 


Total Predicted 


38.57 ±5.05 


Observed 


41 



Table I: Predicted number of selected r ± jet events from each considered process after applying a NN selection value 
of 0.85 assuming a tt pair production cross section of 7.4 pb and top quark mass of 172.5 GeV. We expect roughly 
39 events compared to the observed 41 with nearly half of the events coming from — >■ t ± jets and half from QCD 
multijet production. The uncertainties given are a combination of statistical uncertainties and the selection efficiency 
uncertainties. The QCD multijet uncertainty includes the systematic uncertainty on the fraction of QCD multijet events. 
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where Ndata and N^kgd are the number of events observed in the data and the number of predicted background 
events, respectively. The kinematic acceptance for the process being observed (for the case of au, we measure 
here the acceptance for pp tt ^ t + jets is Acc, e is the product of all geometrical and kinematic event 
selection efficiencies corrected for by data/MC scale factors (SF) when relevant, and C is the total integrated 
luminosity of the data. 



However, in this analysis, N^kgd is a function of the tf cross section ((Jtf), as described in Sec. 2.2 and as a 
result, we cannot simply use Eq. [l]to measure CTfj. Instead, we build a likelihood function based on the Poisson 
probability distribution comparing the number of observed events and the predicted number of events. We then 
minimize the negative log of this likelihood function written as: 

-2-lnL = -2- (Ndata ■ In [ati ■ D + Nb{<Jti)) - In [NdatJ-) - [<Jti ■ D + Nb{att))) , (2) 

where D is the denominator of Eq. [ijand Njj^att) is the number of events from the background prediction for 
a given ati- The result, shown in Fig. [sj is fit with a 2"'^ order polynomial which is used to extract the central 
value and statistical uncertainty. We find the cross section to be 8.7 ± 3.3 (stat.) pb. 



3.1. Top Pair Production Cross Section Systematic Uncertainties and Result 

To measure the systematic uncertainty on the Uti measurement we consider effects on the acceptance, selection 
efficiencies, background estimate, and luminosity. 

For acceptance effects, we consider uncertainties on the jet energy scale (JES), initial and final state radiation 
(ISR, FSR), color reconnection, parton showering, and parton distribution functions (PDF). The jet energy 
measured by the calorimeter is subject to several correction functions each with an associated systematic 
uncertainty [7j. To measure this uncertainty on the cross section, we shift the JES accordingly in the MC 
and re-measure the cross section. Changes in the amount of initial and final state radiation would change our 
acceptance, therefore, we model this effect using pythia MC models with increased and decreased radiation [S]. 
Similarly, we consider acceptance shifts from using models with and without color reconnection effects [S]- We 
consider a 6% systematic uncertainty for differences in parton showering models from different MC generators. 
This number is taken from the difference in tt — >■ r + jets acceptance from MC generated with Pythia and 
Herwig [TU] after requiring the selected r to be matched to a generated t in the MC. This requirement is 
applied because we find that Herwig jets fake r's at a rate higher than Pythia jets, and our studies show that 
Pythia better models the observed r fakes in the data. Finally, we consider changes in acceptance by varying 
the eigenvectors of CTEQ6M [IT] PDF's. 

We consider systematic uncertainties on the efficiency measurements on the 6-tagging, mistag matrix, lepton 
identification, and trigger. Each of these uncertainties are evaluated by re-measuring the cross section with the 
appropriate efficiency or scale factor adjusted by its systematic uncertainty. Due to inefficiencies in modeling 
6-tagging in the MC, we use a tagging scale factor [12] on MC jets matched to heavy flavor to account for the 
^-tagging requirement. Similarly, due to the poor modeling of mistags in MC, we use a data-based parame- 
terization to model the mistagging of MC jets from light flavor [T^]. We consider similar shifts for the lepton 
identification ^13] and trigger efficiency [H] scale factors. 

The background systematics come from the W + heavy fiavor "K-factor" uncertainty and the QCD multijets 
contribution. For the number of events predicted for W -t- heavy fiavor processes, a data/MC scale factor 
called the "K-factor" is used to correct for the fraction of -I- heavy flavor events observed in the MC [6] . To 
measure the uncertainty from the K-factor, we shift it within its errors and take the difference in the cross section 
measurement as the uncertainty. Since QCD multijets events make up nearly 50% of our accepted events, the 
contribution from QCD multijets events is our largest systematic uncertainty. To measure this uncertainty, we 
select QCD multijets events from the data without the and NN selection requirements. We then compare 
the NN output distribution of these events to that of data events with the same requirements removed both 
before and after the 5-tagging requirement. The selection of data events without a requirement is dominated 
by QCD multijets events below a NN value of 0.7, so we can fit the comparison between these distributions 
with a function to build a reweighting scheme for the QCD multijet distribution. By shifting this fit function 
within its uncertainty, we define a la uncertainty on the shape of the QCD multijets distribution. We then 
reweight the QCD multijets events and re-measure the cross section to measure the systematic uncertainty from 
the multijets contribution. 

Finally, we consider a 6% uncertainty on the luminosity measurement from the detector accuracy and the 
uncertainty on the theoretical cross section for inelastic pp collisions 115] . 

A summary of the contribution from each systematic source is given in Tab. jll] 
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Systematic 


5a (pb) 


5a /a (%) 


Jet Energy Scale 


0.6 


6.9 


IFSR 


0.5 


5.7 


Color Reconnection 


0.4 


4.6 


Tagging 


0.4 


4.6 


Mistag Matrix 


0.1 


1.1 


QCD Fraction 


1.8 


20.5 


K-Factor 


0.1 


1.1 


Parton Showering 


0.5 


6.0 


Lepton ID 


0.2 


2.3 


Trigger Efficiency 


0.1 


1.1 


PDF 


0.5 


5.7 


Luminosity 


0.5 


6.0 


Total 


2.2 


25.0 



Table II: Systematic uncertainties for the tt cross section measurement in the r + jets decay channel. The uncertainties 
are given as well as the fractional uncertainty. 



We measure a^^ assuming a top quark mass of 172.5 GeV to be 8.7 ± 3.3 (stat.) ± 2.2 (syst.) pb which is in 
good agreement with the most recent CDF combination of 7.5 ± 0.5 pb pp] . 
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Figure 3: The function —2 • InL versus input a^ as defined in Eq. |2] The solid line is the fit of a second order polynomial 
used to extract the central value and statistical uncertainty. 



4. Top Quark Mass Measurement 

The top quark mass measurement is a Matrix Element style analysis. The mass is extracted from a likelihood 
function based on signal and background probabilities for each event. These probabilities, described in Sec. 



4.2 are calculated from the differential cross section for tt and W + A parton production. The t lepton decay 
adds extra complication to the mass measurement because its decay introduces a second v into the event. To 
account for this, we develop a new method to reconstruct the v from the r decay which allows us to reconstruct 
the original r lepton which is described next. 



4.1. Collinear u Approximation 



The missing energy from the u from the r lepton decay complicates the measurement of the top quark mass 
in the hadronic r + jets decay channel. For this analysis, we developed a new method to reconstruct this i/'s 
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4-momentum which in turn allows us to reconstruct the 4-monientum of the r lepton before it decays. We 
assume that the v from the r decay is nearly collinear with the hadronic components of the r decay within 0.1 
radians in 9 and (j). Additionally, we assume that the (/) angle of the v from the W boson is within 1 radian 
of the These assumptions come from studying MC simulation of if — ^ t + jets events. We introduce 

a 4 dimensional scan across the angles of both v's. Assuming the v mass is negligible and that the W and 
T have masses of 80.4 GeV and 1.8 GeV, respectively, we can completely solve for the 4- vector of each v for 
any set of angles from the scan. We then use the v solutions to predict the in the event and compare it 
to the event's measured with a Gaussian probability function. We chose the set of angles which returns 
the greatest probability based on the comparison. This method accurately reconstructs the 4-momentum 
of the V from the r lepton decay, but it does not perform as well with the 4-momentum of the v from the W 
decay. We use the result to reconstruct the original r lepton in the event. Meanwhile, the v from the W decay 
is reconstructed in the Matrix Element method as described in 16] and below. 



4.2. Top Quark Mass Likelihood Function 

The top quark mass {Mtop) measurement is derived from a likelihood function based on signal and background 
probabilities for each event. The method uses a similar approach as a previous measurement in the electron 
and muon -I- jets decay channels |16j . The signal probability is based on a tt leading-order matrix element 
which assumes qq production P7| and is calculated over 31 input mass values ranging from 145 to 205 GeV for 
each event. The background probability for each event is calculated with a W + jets matrix element from the 
VECBOS [H] generator. Since there is no Mtop dependence in the background probability, it is calculated only 
once for each event. 

To improve the statistical uncertainty on the Mtop measurement, we add a Gaussian constraint on the 
background fraction (1 — c^) to the likelihood function. The background fraction is constrained to be 0.498 ± 
0.106 from Tab. HI The likelihood function is calculated as: 

where P is a combination of the signal (Psig) and background (Pbkgd) probabilites with a relative normalization 
term (Abkgd)- 

P = CsPsig {x; Mtop) + Abkgd (1 ^ C^) Pbkgd (x) ■ (4) 

The signal and background probability are both calculated by integrating over the differential cross section for 
the appropriate process: 

Psig/bkgd= / dasig/bkgd{y)f{qi)f{q2)W{x,y)dqidq2, (5) 

^sig/bkgd J 

where da is the differential cross section, / is the parton distribution function (PDF) for a quark with momentum 
fraction of the incident proton q, x refers to detector measured quantities, y refers to parton level quantities, 
and W{x,y) is the transfer function used to map x to y. After calculating the probabilities for each event, we 
evaluate a likelihood function for each of the 31 input top quark masses and fit the result with a second order 
polynomial to derive Mtop and its statistical uncertainty. We calibrate the measurement on 21 MC samples 
covering a mass range of 155 GeV to 195 GeV. The likelihood function and fit for the data can be seen in Fig. 
|4] We measure Mtop to be 172.7 ± 9.3 (stat.) GeV. 



4.3. Top Quark Mass Measurement Systematic Uncertainties and Result 

We consider 12 different sources of systematic uncertainty for the Mtop measurement. The largest uncertainty 



is from the JES (mentioned in Sec. 3.1). For this uncertainty, we shift the jet energies up and down by each 
correction's uncertainty and sum in quadrature the systematic uncertainty measured from each correction. Since 
the top quark mass is very sensitive to the energy of its daughter particles, the JES uncertainty is the dominant 
uncertainty for this measurement. 

We also consider systematic uncertainties from the differences in parton showering models from different 
MC generators, ISR and FSR, and color reconnection by performing the measurement with MC models which 
account for each effect. The background fraction uncertainty is measured by re-performing the measurement 
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Figure 4: Negative log likelihood (from Eq. |3| as a function of top quark mass for all data events. The calibration 
functions have not yet been applied. 



with pseudoexperiments where the fraction of the background contribution from the QCD multijets background 
and each of the W + jets backgrounds is shifted within its uncertainty from Tab. [ij The uncertainty from each 
background is then added in quadrature to measure the total background fraction uncertainty. 

To measure the uncertainties from PDFs, we measure the shift from the different CTEQ6 eigenvector PDFs. 
The uncertainty from the fraction of gluon-gluon fusion produced tt events is evaluated by reweighting the MC 
events so that the percentage of tt events which result from gluon-gluon fusion is shifted from 5% to 20%. 

The 6-jet uncertainty accounts for different fragmentation models and semileptonic branching ratios for jets 
from b quarks |19) . These uncertainties are added in quadrature to an uncertainty measured from shifting the 
energy scale of jets from h quarks to get the total 6-jet uncertainty. We also account for shifts from the lepton 
energy scale by considering changes in the measurement from MC samples with shifted r energy. 

The pileup systematic uncertainty accounts for a known mismodeling in the luminosity profile of the MC. 
To evaluate this, we measure the shift in the measurement from MC events which are reweighted to give the 
luminosity profile which is seen in the data. 

Finally, we consider systematic uncertainties from our calibrations. We shift the calibration function within 
its uncertainty to measure the calibration systematic uncertainty. Even after the calibration function is applied, 
we find a 0.14 GeV uncertainty on the fit of the mass residual (defined as the true mass substracted from the 
measured mass) across all 31 mass points. Due to this, we take a 0.14 GeVsystematic uncertainty for MC 
statistics. 

The full table of systematic uncertainties for the top quark mass measurement can be found in Tab. |III| 

Having evaluated all uncertainties, we measure Mtop to be 172.7 ± 9.3 (stat.) ± 3.7 (syst.) GeV which agrees 
with the most recent Tevatron combination of 173.2 ± 0.9 GeV [21] . 



5. Conclusion 



We use the r -I- jets decay channel to identify tt events as well as measure the top quark properties with 

2.2 fb~^ of data. We find the ti pair production cross section to be 8.8 ± 3.3 (stat.) ± 2.2 (syst. -|- lumi.) pb. 
We also measure the top quark mass in this decay channel for the first time ever and find it to be 172.7 ± 

9.3 (stat.) ±3.7 (syst.) GeV. We find the measurements to be consistent with the current CDF combination top 
pair production cross section of 7.5 ± 0.5 pb j20^ and the Summer 2011 Tevatron top quark mass combination 
of 173.2 ± 0.9 GeV [21 . The values we measure with r leptons agree with current measurements, therefore, we 
find no evidence against lepton universality. Additionally, the success of these measurements demonstrates that 
we can do complicated analyses with r leptons in high jet multiplicity environments at hadron colliders. 
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Source 


Uncertainty (GeV) 


JES 


3.37 


MC Generator 


0.50 


ISR/FSR 


0.34 


Color Reconnection 


0.50 


Background Fraction 


0.47 


MC Statistics 


0.14 


PDF 


0.12 


gg fusion 


0.17 


B-jet 


0.39 


Lepton pt 


0.19 


Pileup 


0.95 


Calibration 


0.17 


Total 


3.7 



Table III: Total systematic uncertainties on Mtop for the t + jets channel. 
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